AIME 2024

mentions 2 type Person feed RSS

// recent coverage 2 mentions

20:24

2026-07-10

machinebrief.com

artificial-intelligence

TREK: A New Path in AI Problem Solving

Researchers introduced TREK, a method that improves AI problem-solving by using verified output trajectories to extend model learning. TREK boosted Qwen3-8B's performance on AIME 2024 from 36.9 to 40.…

00:00

2026-04-20

andlukyane.com

large-language-models

FIPO: Teaching LLMs Which Thoughts Actually Matter

FIPO (Future-Impact-based Policy Optimization) is a reinforcement learning method that improves LLM reasoning by assigning token-level credit based on each token's future impact on the policy, rather …

// co-occurs with top 8 entities

FIPO 1 Qwen2.5-32B-Base 1 DAPO 1 VeRL 1 TREK 1 Qwen3 1 AIME 2025 1 ALFWorld 1